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Conference voice processing method, apparatus and information memory medium therefor 



(57) The invention provides an apparatus for auto- 
matically preparing the minutes of a conference. 

When the voices of statements of plural speakers 
are entered into voice input means 1 2 in a state in which 
the vocalization data are registered in vocalization 
memory means 1 1 for each identification data of the 
plural speakers, the speaker and content of statement 
are recognized according to the registered data. The 
recognized content of statements is edited, together 
with the identification data of the speakers, by record 
preparation means 14 into a predetermined form to pro- 
vide minutes data, whereby the minutes data of the con- 
tent of statements of the plural speakers can be 
automatically prepared, on real time basis, edited in a 
predetermined form. 
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Description 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] The present invention relates to a confer- 
ence voice processing method for preparing minutes 
data, a recording apparatus therefor, and an information 
memory medium storing a program for causing various 
processes, as a software. 

Related Background Art 

[0002] In ordinary business firms, a meeting or con- 
ference by plural employees is an important event, and 
it is also important to record the oral statements at such 
meeting. In recent years, the conference can be also 
executed through communication equipment such as a 
telephone or a television conference system, and it is 
often also necessary to record the content of such con- 
ference. 

[0003] For recording the content of the conference 
or communication mentioned above, there Is ordinarily 
utilized voice recording by a tape recorder, as disclosed 
in the Japanese Patent Application Laid-Open No. 8- 
194492. 

[0004] In business firms, it is often requested to 
summarize the content of the conference or communi- 
cation as a document, and, in case recording of the 
statements of the attendees of the conference is 
required, there are prepared minutes In the form of a 
document. 

[0005] In case of preparing minutes of a confer- 
ence, an exclusive worker for recording the statements 
of the attendees attends the conference, so that such 
exclusive worker is required for the preparation of min- 
utes. Also in case the content of a conference by com- 
munication has to be reduced to a document, it is 
prepared by reproducing the recorded voice, but such 
process is cumbersome and does not allow to obtain 
the document on real time basis. 
[0006] Since the voice recognition apparatus of 
high perfomnance is already developed, it is possible to 
reduce the content of the statements into text data by 
utilizing such voice recognition apparatus for the confer- 
ence or communication. However, though such method 
allows to fonri the content of the statements of plural 
speakers, it is not possible to specify the speaker of 
each statement. 

SUN/IMARY OF THE INVENTION 

[0007] In consideration of the foregoing, the object 
of an embodiment is to provide a conference voice 
processing method capable of recording the content of 
statements at a conference or a communication while 
specifying the speaker of each statement, a recording 



apparatus therefor, and an infomriation memory medium 
storing, as a software, a program for causing a compu- 
ter to execute the conference voice processing method 
of the present embodiment. 

5 [0008] The above-mentioned object can be 
attained, according to a first embodiment of the present 
invention, by a conference voice recording apparatus 
comprising pronunciation memory means in which pro- 
nunciation data are registered for each identification 

10 data of plural speakers; voice input means for entering 
the voices of statements of the plural speakers; voice 
recognition means for recognizing the speaker and the 
content of statement thereof, entered into the voice 
input means, based on the pronunciation data and the 

15 identification data registered in the pronunciation mem- 
ory means; and record preparation (formation) means 
for preparing (fomning) minutes data of the content of 
statement recognized by the voice recognition means, 
edited in a predetermined form together with the Identi- 

20 fication data of the speakers. 

[0009] Consequently, according to the conference 
voice processing method utilizing the conference voice 
recording apparatus of the embodiment, when the voice 
data of the statements of the plural speakers are 

25 entered into the voice Input means in a state in which 
the pronunciation data are registered in the pronuncia- 
tion memory means for each identification data of the 
plural speakers, the voice recognition means recog- 
nizes the speaker and the content of statement of the 

30 entered voice data, based on the pronunciation data 
and the identification data registered in the pronuncia- 
tion memory means. As the record preparation means 
prepares thus recognized content of statement into min- 
utes data edited into a predetermined form together 

35 with the identification data of the speaker, there are 
automatically prepared, on real time basis, minutes data 
of the content of statements of the plural speakers, 
edited in a predetermined form together with the identi- 
fication data. 

40 [0010] According to a second embodiment of the 
present Invention, there is provided a conference voice 
recording apparatus comprising pronunciation memory 
means in which pronunciation data are registered for 
each identification data of plural speakers; voice input 
45 means for entering data of the voices of statements of 
the plural speakers; identification input means for suc- 
cessively entering the identification data of the speakers 
making statements; pronunciation selection means for 
selecting the pronunciation data of the corresponding 
50 speaker from the pronunciation memory means, con-e- 
spending to the identification data entered into the iden- 
tification input means; voice recognition means for 
recognizing the content of statement of the voice data 
entered into the voice input means, based on the pre- 
ss nunclation data selected by the voice selection means; 
and recording preparation means for preparing minutes 
data of the content of statement recognized by the voice 
recognition means, edited in a predetermined form 
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together with the identrfication data entered into identifi- 
cation input means. 

[0011] Consequently, according to the conference 

voice processing method utilizing the conference voice 
recording apparatus of the present embodiment, in a 5 
state in which the pronunciation data are registered in 
the pronunciation memory means for each identification 
data of the plural speakers, simultaneous with the entry 
of the voice data of the statements of the plural speak- 
ers into the voice input means, the identification data of io 
the speakers making statements are entered in succes- 
sion from the identification input means. Then, corre- 
sponding to thus entered identification data, the 
pronunciation selection means selects the pronuncia- 
tion data of the corresponding speaker from the pronun- 75 
elation memory means, and, based on thus selected 
pronunciation data, the voice recognition means recog- 
nizes the content of statement of the entered voice data. 
As the record preparation means prepares thus recog- 
nized content of statement into minutes data edited into 20 
a predetermined form together with the identification 
data of the speaker, there are automatically prepared, 
on real time basis, minutes data of the content of state- 
ments of the plural speakers, edited in a predetemnined 
form together with the identification data. 25 
[0012] According to a third embodiment of the 
present invention, there is provided a conference voice 
recording apparatus comprising pronunciation memory 
means in which pronunciation data are registered for 
each identification data of plural speakers; plural voice 30 
input means for individually entering the data of the 
voices of statements of the plural speakers; voice rec- 
ognition means for recognizing the content of statement 
of the voice data entered into the plural voice input 
means, based on the pronunciation data of the corre- 35 
spending speaker in the pronunciation memory means; 
and record preparation means for preparing minutes 
data of the content of statement recognized by the voice 
recognition means, edited in a predetemnined fonn 
together with the identification data of the speaker. 40 
[0013] Consequently, according to the conference 
voice processing method utilizing the conference voice 
recording apparatus of the present embodiment, in a 
state in which the pronunciation data are registered in 
the pronunciation memory means for each identification 45 
data of the plural speakers, the voice data of the state- 
ments of the plural speakers are entered individually 
into the plural voice input means, and the voice recogni- 
tion means recognizes the content of statement of the 
voice data entered into the plural voice input means, so 
based on the pronunciation data of the con-esponding 
speaker of the pronunciation memory means. As the 
record preparation means prepares thus recognized 
content of statement into minutes data edited into a pre- 
determined fomn together with the identification data of ss 
the speaker, there are automatically prepared, on real 
time basts, minutes data of the content of statements of 
the plural speakers, edited in a predetermined form 



together with the identification data. 
[0014] In a fourth embodiment, there is provided a 
data communication apparatus for executing data com- 
munication by plural speakers, the apparatus compris- 
ing the conference voice recording apparatus of the 
present invention, wherein the pronunciation memory 
means registers pronunciation data for each Identifica- 
tion data of plural speakers convening by data commu- 
nication, and the record preparation means is adapted 
to prepare the minutes data from the voice of state- 
ments of the plural speakers in the data communication. 
[0015] Consequently, according to the conference 
voice processing method utilizing the conference voice 
recording apparatus of the present embodiment, when 
plural speakers convene through data communication, 
there are automatically prepared, on real time basis, 
minutes data of the content of statements in the confer- 
ence by the data communication, edited in a predeter- 
mined form together with the identification data. 
[0016] In the above-described conference voice 
recording apparatus, there may also be provided image 
display means for displaying the above-mentioned min- 
utes data; correction input means for entering correction 
data on the minutes data displayed by the image display 
means; and corection execution means for correcting 
the minutes data, con-esponding to the input data of the 
correction input means. 

[0017] In such case, in a state in which the image 
display means displays the minutes data, the correction 
on thus displayed minutes data is entered by the con-ec- 
tion input means. In response to the input data, the cor- 
rection execution means conrects the minutes data, 
whereby the minutes data are appropriately corrected in 
the course of display. 

[0018] In the above -described conference voice 
recording apparatus, there may also be provided image 
display means for displaying the above-mentioned min- 
utes data; approval Input means for entering an 
approval for the minutes data displayed on the image 
display means, for each identification data of the 
speaker; and approval addition means for adding an 
approval, entered into the approval input means, 
together with the identification data of the speaker to the 
minutes data. 

[0019] In such case, in a state in which the image 
display means displays the minutes data, the approval 
for thus displayed minutes data is entered by the 
approval input means. In response to the input 
approval, the approval addition means adds the entered 
approval to the minutes data together with the identifica- 
tion data of the speaker, whereby the approval by the 
speaker is also recorded in the minutes data. 
[0020] In the above-described conference voice 
recording apparatus, the approval input means can also 
enter the approval for the minutes data as voice data by 
the aforementioned voice input means. In such case, as 
the approval input means enters the approval for the 
minutes data in the fonm of voice data by the voice input 
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means, the approval is entered for each speaker by the 
voice data specifying the speaker. 
[0021] In the above-described conference vorce 
recording apparatus, there may also be provided record 
printing means for printing the minutes data. In such 
case the record printing means prints the minutes data, 
whereby printed Is a document recording the content of 
statements of the plural speakers together with the 
identification data of the speakers. 
[0022] In the above-described conference voice 
recording apparatus, there may also be provided time 
detection means for detecting the current time at the 
entry of the voice of the speaker into the voice input 
means, and the record preparation means may prepare 
the minutes data, utilizing also the time detected by the 
time detection means. 

[0023] In such case the time detection means 
detects the cun-ent time when the voice of statement of 
the speaker is entered into the voice input means, and 
the record preparation means prepares the minutes 
data utilizing the detected time, whereby the prepared 
minutes data also record the time of statement by the 
speaker. 

[0024] Various means in the present invention may 
be so formed as to realize the corresponding functions, 
and include, for example, an exclusive hardware for a 
predetermined function, a computer to which a prede- 
termined function is provided by a program, a predeter- 
mined function realized by a program in a computer, and 
combinations thereof. 

[0025] For example, the pronunciation memory 
means is only required to register the pronunciation 
data for each identification data of the plural speakers, 
and includes a memory area of an information memory 
medium such as a RAM (random access memory). The 
voice input means is only required to enter the voice of 
statement, and includes a microphone for outputting a 
signal corresponding to the voice input, a tape recorder 
for reproducing recorded data, a communication l/F for 
receiving voice data etc. 

[0026] The image display means is only required to 
display the minutes data, and includes, for example, a 
display device such as a CRT (cathode ray tube) or a 
LCD (liquid crystal display) device. The record printing 
means is only required to print the minutes data, and 
includes, for example, a printing device such as a laser 
printer or a bubble jet printer. 

[0027] The correction input means is only required 
to enter a connection on the minutes data, and, includes, 
for example, a manually operated keyboard. The 
approval input means is only required to enter the 
approval for the minutes data, for each identification 
data of the speakers, and includes, for example, a voice 
recognition apparatus composed of a microphone and a 
computer. 

[0028] A first information memory medium of the 
present embodiment stores a software readable by a 
computer connected with pronunciation memory means 



for registering pronunciation data for each identification 
data of the plural speakers and voice input means in 
which the voice data of statements of plural speakers 
are registered; 

5 wherein the memory medium stores a program 

for causing the computer to execute recognition of the 
speaker and the content of statement of the voice data 
entered into the voice input means, based on the pro- 
nunciation data and identification data registered in the 

10 pronunciation memory means, and preparation of min- 
utes data of the recognized content of statements, 
edited in a predetermined form together with the Identi- 
fication data of the speakers. 
[0029] Consequently, when the program stored in 

IS the information memory medium of the present embod- 
iment is read by the computer to execute the con^e- 
sponding process, the computer recognizes the 
speaker and the content of statements of the voice data 
entered into the voice input means, based on the pro- 

20 nunciation data and Identification data registered in the 
pronunciation memory means, and prepares the min- 
utes data of the recognized content of the statements, 
edited in a predetemnined form together with the identi- 
fication data of the speakers. Therefore, when plural 

25 speakers make statements, there are automatically pre- 
pared, on real time basis, the minutes data of such 
statements, edited in a predetemnined form together 
with the identification data. 

[0030] A second infonnation memory medium of a 

30 second embodiment stores a software readable by a 
computer connected with pronunciation memory means 
for registering pronunciation data for each identification 
data of the plural speakers and voice input means in 
which the voice data of statements of plural speakers 

35 are registered; 

wherein the memory medium stores a program 
for causing the computer to execute reception of input of 
the identification data of the speaker making statement; 
selection of the pronunciation data of the corresponding 

40 speaker from the pronunciation memory means, con^e- 
sponding to the entered identification data; recognition 
of the content of statement of the voice data entered 
into the voice input means, based on the selected pro- 
nunciation data; and preparation of minutes data of the 

45 recognized content of statement, edited in a predeter- 
mined form together with the identification data. 
[0031] Consequently, when the program stored in 
the information memory medium of the present embod- 
iment Is read by the computer to execute the conre- 

50 spending process, the computer receives the Input of 
the identification data of the speaker making statement, 
and selects the pronunciation data of the corresponding 
speaker from the pronunciation memory means, con-e- 
sponding to the entered Identification data. It recognizes 

55 the content of statement of the voice data entered Into 
the voice input means, based on the selected pronunci- 
ation data, and prepares the minutes data of the recog- 
nized content of the statement, edited in a 
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predetermined form together with the identification data 
of the speakers. Therefore, when plural speakers make 
statements, there are automatically prepared, on real 
time basis, the minutes data of such statement, edited 
in a predetermined form together with the identification 5 
data. 

[0032] A third information memory medium of a 
third embodiment stores a software readable by a com- 
puter connected with pronunciation memory means for 
registering pronunciation data for each identification 10 
data of the plural speakers and plural voice input means 
in which the statements of plural speakers are individu- 
ally registered; 

wherein the memory medium stores a program 
for causing the computer to execute recognition of the 15 
content of statement of the voice data entered into the 
voice input means, based on the pronunciation data and 
identification data registered in the pronunciation mem- 
ory means, and preparation of minutes data of the rec- 
ognized content of statements, edited m a 20 
predetermined form together with the identification data 
of the speakers. 

[0033] Consequently, when the program stored in 
the information memory medium of the present embod- 
iment is read by the computer to execute the corre- 25 
spending process, the computer recognizes the content 
of statements of the voice data entered into the plural 
voice input means, based on the pronundation data and 
identification data registered in the pronunciation mem- 
ory means, and prepares the minutes data of the recog- 30 
nized content of the statements, edited in a 
predetermined form together with the identification data 
of the speakers. Therefore, when plural speakers make 
statements, there are automatically prepared, on real 
time basis, the minutes data of such statements, edited 35 
in a predetermined fomi together with the identification 
data. 

[0034] The information memory medium in the 
present embodiment can be any hardware storing in 
advance a program for causing a computer to execute 40 
various processes as a software, and includes, for 
example, a ROM (read only memory) or a HDD (hard 
disk drive) fixed to an apparatus in which the computer 
constitutes a part, or a CD-ROM or a FD (floppy disk) 
detachably attached to an apparatus in which the com- 45 
puter constitutes a part. 

[0035] Also the computer in the above embodi- 
ments can be any apparatus capable of reading a pro- 
gram consisting of a software and executing a 
corresponding process, and includes, for example, an so 
apparatus principally composed of a CPU (central 
processing unit) and additionally provided with various 
devices such as a ROM, a RAM, an l/F etc. according to 
the necessity. 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0036] 

Rg. 1 is a schematic view showing the logic config- 
uration of a conference voice recording apparatus 
embodying the present invention; 
Rg. 2 is a block diagram showing the physical con- 
figuration thereof; 

Rg. 3 is a schematic view showing a state in which 
the conference voice recording apparatus embody- 
ing the present invention is used in a conference; 
Rg. 4 is a flow chart showing a subroutine of initial- 
ization In a conference voice processing method in 
the conference voice recording apparatus; 
Rg. 5 is a flow chart showing a main routine of the 
conference voice processing method; 
Rg. 6 is a flow chart showing a subroutine of a post 
process of the conference voice processing 
method; and 

Rg. 7 is a plan view showing a printed image of the 
minutes data. 

Rg. 8 is a chart of a flow upon using a conference 
room provided with a voice-character conversion 
output system explained above. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

[0037] In the following, an embodiment of the 
present invention will be explained with reference to the 
attached drawings. In the following, however, portions of 
the embodiment identical with those in the aforemen- 
tioned conventional example will be represented by 
same names and will be omitted from the explanation. 
Fig. 1 is a schemata view showing the logical configura- 
tion of a conference voice recording apparatus of the 
present embodiment; Fig. 2 is a block diagram showing 
the physical configuration; Fig. 3 is a schematic view 
showing a state in which the conference voice recording 
apparatus of the present embodiment is used in a con- 
ference; Fig. 4 is a flow chart showing the initialization in 
a conference voice processing method of the confer- 
ence voice recording apparatus; Fig. 5 is a flow chart 
showing a main routine of the conference voice 
processing method; Fig. 6 is a flow chart showing a sub- 
routine of a post process in the conference voice 
processing method; and Fig. 7 is a plan view showing a 
printed image of the minutes data; Rg. 8 is a chart of a 
flow upon using a conference room provided with a 
voice-character conversion output system explained 
above. 

[0038] A minutes generation system 1 constituted 
by the conference voice recording apparatus of the 
present Invention is provided, as shown in Rg. 2, with a 
computer system 100 consisting of an ordinary per- 
sonal computer, and such computer system 1 00 is pro- 
vided, as a principal hardware of the computer, with a 
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CPU 101. 

[0039] The CPU 101 Is connected, through a bus 
line 102, to hardwares such as a ROM 103, a RAM 104, 
an HDD 1 05, an FDD (FD drive) 1 07 in which an FD 1 06 
is detachably mounted, a CD drive 109 in which a CD- 
ROM 108 is detachably mounted, a keyboard 110, a 
mouse 1 1 1 . a display 1 1 2, a USB (universal serial bus)- 
l/F113etc. 

[0040] The USB-I/F 1 1 3 is connected, by a connec- 
tor 1 14, to a microphone unit 1 15, a card reader 1 17 in 
which an IC (integrated circuit) card 116 is detachably 
mounted, a laser printer 118 and a large-sized display 
119. 

[0041] The minutes generation system 1 of the 
present embodiment is to be used in a conference 
attended by plural speakers 2 as shown in Fig. 3, and is 
so constructed as to prepare (form) the minutes data of 
such conference. Therefore the microphone unit 1 15 is 
designed omnidirectional, and thus receiving voice data 
from the entire circumference and outputting a signal to 
the connector 114. The IC card 1 1 6 is prepared and car- 
ried by each of the plural speakers 2 attending the con- 
ference, and registers the identification data of the 
speaker and the basic vocalization data, for example of 
the basic syllablaries, of the speaker. 
[0042] In the minutes generation system 1 of the 
present embodiment, the hardwares such as the ROM 
103, RAM 104, HDD 105, FD 106, CD-ROM 108. IC 
card 116 etc. constitute the infonnation memory media, 
and the control programs and various data required for 
various functions of the computer system 100 are 
stored as software in at least one of such memory 
media. 

[0043] For example, the control program for causing 
the CPU 101 of the computer system 100 to execute 
various process operations is stored in advance in the 
FD 106 or the CD-ROM 108. Such software is installed 
in advance in the HDD 105, and is copied to the RAM 
1 04 and read by the CPU 101 at the start of the compu- 
ter system 1 00. 

[0044] Through reading of an appropriate program 
and executing various process operations by the CPU 
1 01 of the computer system 100, in the minutes gener- 
ation system 1 of the present invention, there are logi- 
cally realized various means shown in Rg. 1 such as 
vocalization memory means 11, voice input means 12, 
voice recognition means 13, record preparation (fomia- 
tion) means 14, image display means 15, correction 
input means 16, con'ection execution means 17, 
approval input means 18. approval addition means 19, 
recording printing means 20 etc. 
[0045] The vocalization memory means 11 corre- 
sponds to a memory area constructed for example in 
the HDD 105 so as to be recognizable by the CPU 101 
con'esponding to the control program for example in the 
RAM 104, and registers the vocalization data for each 
identification data of the plural speakers. 
[0046] More specifically, in the minutes generation 



system 1 of the present embodiment, the IC 116 pre- 
pared and carried by each of the speakers 2 attending 
the conference registers identification data and vocali- 
zation data as explained in the foregoing, and various 
5 data registered in the IC card 1 1 6 are copied by the card 
reader 117 into the HDD 105, immediately before the 
start of the conference. 

[0047] The voice input means 12 corresponds to a 
function of the CPU 101 for executing data processing 

10 of the voice input from the microphone unit 115 con'e- 
sponding to the control program stored for example in 
the RAM 104, and receives the input of the voice data 
by statements of the plural speakers 2. 
[0048] The voice recognition means 13 conre- 

15 spends to a function of the CPU 1 01 for executing pre- 
detemiined data processing con^esponding to the 
control program stored for example in the RAM 1 04, and 
recognizes the speaker 2 and the content of statement 
of the speaker 2 of the voice data entered into the voice 

20 input means 12, based on the vocalization data and 
identification data of the vocalization memory means 
11. 

[0049] More specifically, the voice recognition 
means 1 3 causes the RAM 1 04 or the like to temporarily 

25 store a series of the voice data entered in succession in 
time into the voice input means 12 and specifies the 
identification data of the speaker 2, based on the vocal- 
ization data of the plural speakers in the vocalization 
memory means 11. When the speaker 2 is thus speci- 

30 fied, it read the corresponding vocalization data from 
the vocalization memory means 1 1 and executes voice 
recognition utilizing such voice data, thereby converting 
the content of statement of the voice data, entered from 
the voice input means 12 and temporarily stored, into 

35 text data. 

[0050] The record preparation means 14 conre- 
sponding to a function of the CPU 1 01 for executing pre- 
detennined data processing, corresponding to the 
control program stored for example in the RAM 1 04, and 
40 prepares minutes data of the content of statement rec- 
ognized by the voice recognition means 13, edited in a 
predetemiined form together with the identification data 
of the speaker 2. 

[0051 ] The minutes data are prepared in a form, for 
45 example as shown in Fig. 7, recording the text data of 
the content of statements at the conference in the 
sequential order of such statements and the names of 
the speakers 2 of such statements, together with the 
title of subject of the conference, the location and data 
50 thereof, and a list of the names of the speakers 2 attend- 
ing the conference and the identification data of the 
speakers. 

[0052] The image display means 1 5 corresponds to 
a function of the CPU 1 01 for controlling the function of 
55 the display on the display 112, 119, corresponding to 
the control program stored for example in the RAM 1 04, 
and displays the minutes data prepared by the record 
preparation means 1 4 to the plural speakers 2 attending 
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to the conference. 

[0053] The correction input means 1 6 con-esponds 
to a function of the CPU 101 for processing the input 
operation of the keyboard 1 1 0, corresponding to the 
control program stored for example in the RAM 104, and 
receives input of con-ection data for the minutes data 
displayed by the image display means 15. 
[0054] The correction execution means 17 corre- 
sponds to a function of the CPU 101 for executing pre- 
determined data processing, con'esponding to the 
control program stored for example in the RAM 1 04, and 
corrects the minutes data con-esponding to the input 
data of the correction input means 16. The aforemen- 
tioned image display means 15 is capable of also dis- 
playing, on real time basis, the minutes data connected 
by the con-ection execution means 17. 
[0055] The approval input means 1 8 con-esponds to 
a function of the CPU 101 for processing the voice input 
of the microphone unit 1 1 5, con-esponding to the control 
program stored for example in the RAM 104, and enters 
an approval for the minutes data displayed by the image 
display means 15, for each identification data of the 
speakers 2. 

[0056] The approval addition means 19 corre- 
sponds to a function of the CPU 101 for executing pre- 
determined data processing, con-esponding to the 
control program stored for example in the RAM 1 04, and 
adds the approval entered for example by voice data, to 
the minutes data, together with the identification data of 
the speakers 2. 

[0057] More specifically, the approval input means 
18 displays, for example on the display 119, the names 
of the speakers 2 in succession, together with a prede- 
termined guidance text such as "Do you approve the 
minutes and, when the speaker 2 whose name is 
displayed enters a statement such as "Yes" into the 
microphone 115, the speaker 2 of thus entered voice 
data is confirmed and an approval mark is attached in a 
con-esponding position of the list of the speakers 2 in 
the minutes data. 

[0058] The record printing means 20 con-esponds 
to a function of the CPU 101 for controlling the data 
printing of the laser printer 118, con-esponding to the 
control program stored for example in the RAM 1 04, and 
executes printing on a printing sheet 3 in the laser 
printer 118 as shown in Fig. 7. 
[0059] Prior to the printing of the minutes data by 
the laser printer 118 serving as the record printing 
means 20 or the display of the minutes data by the dis- 
play 112, 119 serving as the image display means 15, 
the minutes data may be stored in the HDD 105 or the 
FD 1 06 as a pre data processing. 
[0060] The above-mentioned various means are 
realized utilizing a hardware such as the keyboard 110 
or the display 112 according to the necessity, but are 
principally realized by the function of the CPU 101 
which is a computer hardware, corresponding to the 
software stored in the information memory medium 



such as the RAM 104. 

[0061] Such software is stored in the information 
memory medium such as the RAM 1 04 as a control pro- 
gram for causing the CPU 101 to execute various proc- 

5 ess operations such as registration of the vocalization 
data for each identification data of the plural speakers, 
reception of the voice data of the statements of the plu- 
ral speakers 2 through the microphone unit 115, recog- 
nition of the speaker 2 and the content of statement of 

10 the entered voice data based on the registered vocaliza- 
tion data of the plural speakers 2. preparation of the 
minutes data of the recognized content of statements, 
edited in a predetemnined form together with the identi- 
fication data of the speaker 2, display of the prepared 

75 minutes data on the display 112, 119, reception of cor- 
rection data, for example from the keyboard 110, for the 
displayed minutes data, con-ection of the minutes data 
corresponding thus entered data, entry of the approval 
for the displayed minutes data from the microphone unit 

20 1 15 for each identification data of the speakers 2, addi- 
tion of the entered approval to the minutes data together 
with the identification data of the speaker 2, and printing 
of the minutes data for example by the laser printer 1 1 8. 
[0062] In the following there will be explained a con- 

25 ference voice processing method utilizing the minutes 
generation system 1 of the present embodiment, having 
the above-described configuration. 
As preparation for the conference, the various units of 
the minutes generation system 1 are suitably posi- 

30 tioned. For example, as shown in Fig. 3, there is pre- 
pared a large-sized table 4 at which the plural speakers 
2 are to be seated, and an omnidirectional microphone 
unit 115 is provided in a position capable of receiving 
the statements of all the speakers 2, 

35 [0063] Also a large-sized display 1 19 is provided in 
a position observable by all the speakers 2, then a key- 
board 1 1 0 of the computer system 1 00 Is provided In a 
position of a speaker 2 serving as the chairman, and a 
card reader 117 is provided close to the entrance (not 

40 shown) of the conference room. 

[0064] In such state, each of the plural speakers 2 
attending the conference cames the IC 11 6 in which the 
identification data of the speaker himself and the vocal- 
ization data thereof are registered, and, upon entering 

45 the conference room, sets the IC 1 1 6 in the card reader 
117 as shown in Rg. 4. 

[0065] Then the minutes generation system 1 
reads, by the card reader 1 1 7, the vocalization data and 
identification data registered in the IC card 1 1 6 and cop- 
50 ies such data in the HDD 105 (steps SI, 84), and the 
chainman, confirming the data input of all the plural 
speakers 2 attending the conference, enters the com- 
pletion of the initialization through the keyboard 110 
(step S3, T1). 

55 [0066] However the speaker 2 attending the confer- 
ence may forget to can^ the IC card 1 1 6 or may forget 
the data registration therein. In such case, therefore, the 
minutes generation system 1 of the present embodi- 
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ment is capable of assuming a registration mode by an 
input operation on the keyboard 110 (step S2), where- 
upon the identification data of the speaker 2 can be 
entered by the keyboard 110 and the voice data gener- 
ated by the speaker 2 can be registered through the 
microphone unit 115 (step S5). 
[0067] When the vok:e data of all the plural speak- 
ers 2 attending the conference are stored, together with 
the identification data, in the minutes generation system 
1 as explained in the foregoing, the system 1 , recogniz- 
ing the completion of initialization, enters a state capa- 
ble of automatically generating the minutes of the 
conference (step T1). 

[0068] When the conference is initiated by the plural 
speakers 2 in response to the declaration by the chair- 
man, the minutes generation system 1 receives the 
voice data of the statement entered from the speaker 2 
through the microphone unit 115 (step T2) and executes 
voice recognition based on the voice data of all the 
speakers 2 stored temporarily in the HDD 105 (step T3). 
[0069] It issues identification data by specifying the 
speaker 2 of the statement (steps T4, T5), then exe- 
cutes data conversion on the content of the statement 
(steps T6, T7) and stores the text data of the content of 
statement for example in the RAM 104, together with 
the identification data of the speaker 2 (step T8). 
[0070] In case the speaker 2 making the statement 
cannot be specified, there are outputted exclusive inca- 
pable data (step T9), and, exclusive incapable data are 
outputted also in case the content of the statement can- 
not be recognized (step T10). These incapable data are 
also stored, together with the ordinary data, for example 
in the RAM 104 (stepTS). 

[0071] The minutes generation system 1 of the 
present embodiment executes the above-described 
serial process (steps T2 - T1 0) for each statement of the 
speaker 2, so that the content of the statements at the 
conference is time-sequentially recorded together with 
the identification data of the speakers 2. When the con- 
ference is temriinated, the chairman manipulates the 
keyboard 110 to shift the minutes generation system 1 
to the execution of a post process (steps T1 1 , T1 2). 
[0072] In such state, in response to the input of a 
document fonn by the chairman into the keyboard 110 
(step El), the minutes data in the RAM 104 are edited 
into a predetermined form (step E2). The minutes data 
thus prepared are displayed by the display 1 12, 1 19 to 
all the plural speakers 2 (step E3), whereby all the 
speakers can confirm the content of the minutes. 
[0073] The displayed minutes data, having a layout 
similar to that of the printed result, contain the sequen- 
tial text data of the content of statements at the confer- 
ence together with the names of the speakers 2, as well 
as the title of subject of the conference, the location and 
date of the conference, and a list of the names and Iden- 
tification data of the speakers 2 attending the confer- 
ence, as shown in Fig. 7. 

[0074] In this state, in the position of the content of 



statement or the speaker 2 that cannot be recognized, 
there is displayed a blank corresponding to the time of 
such statement, together with a guidance message 
such as "not recognizable". Therefore, the chairman 

5 enters appropriate text data by the keyboard 1 1 0, under 
the consent of all the speakers (steps E2, E4). Also the 
result of voice recognition may be erroneous, and, in 
such case, the chaimian corrects the text data by the 
keyboard 1 1 0, again under the consent of all the speak- 

10 ers. 

[0075] The minutes data thus connected are also 
displayed on real time basis (step E3), so that, upon 
completion of the correction, there is assumed a state of 
receiving the approvals by all the speakers by an input 

15 operation on the keyboard 1 10. In this state the names 
of the speakers 2 are displayed in succession on the 
display 119, together with a predetermined guidance 
text such as "Do you approve the minutes?", and the 
speaker 2 whose name is displayed enters a statement 

20 such as "Yes" into the microphone unit 115 in case of 
approving the content of the minutes (step E5). 
[0076] The minutes generation system 1 confines 
that the speaker 2 of such voice data coincides with the 
speaker 2 displayed on the display 1 1 9, and, upon such 

25 confinnation, displays an approval mark "O" in a con-e- 
sponding position in the speaker list of the minutes data 
(step E6). 

[0077] When the minutes data are approved by all 
the plural speakers 2, the chainnah instructs, through 

30 the keyboard 1 1 0, the minutes generation system 1 to 
execute printing. In response, the minutes generation 
system 1 prints, by the number of the speakers 2, a 
minute document recording the text data of the content 
of the statements with the names of the speakers 2, 

35 together with the title of subject of the conference, the 
location and date thereof, a list of the names and identi- 
fication data of the speakers 2 attending the conference, 
presence/absence of approval etc. (steps E7, E8). 
[0078] As explained in the foregoing, the minutes 

40 generation system 1 of the present embodiment can 
automatically generate the minutes of the conference 
on real time basis and provide the printed output of such 
minutes. The minutes generation system 1 of the 
present embodiment is capable, by registering the voice 

45 data of the plural speakers 2, of specifying the speaker 
2 making statement and converting the content of such 
statement into text data, thereby recording the text data 
of the content of statement in the minutes together with 
the identification data of the speaker 2. so that the 

50 speaker, time and content of each statement can be 
confinned on the minutes. 

[0079] Though the identification of the speaker 2 or 
the data conversion of the content of statement thereof 
by voice recognition is not yet technically complete at 
55 present, the minutes generation system 1 of the present 
embodiment displays the prepared minutes data on the 
displays 112, 119 and accepts correction by the key- 
board 110, so that appropriate minutes can be finally 
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obtained. 

[0080] Besides, the system displays the prepared 
minutes data through the display 1 10 to the speakers 2 
attending the conference and adds the approvals to the 
minutes data when they are approved by the speakers s 
2, so that the minutes data can also record the approv- 
als by the speakers 2. In particular, such approval by the 
speaker 2 is entered by the voice input thereof, so that 
the input of the approval data can be accepted in a form 
capable of specifying the speaker 2, without requiring io 
addition of an exclusive device for the Input of the 
approval data. 

[0081] The present invention is not limited to the 
foregoing embodiment but is subject to various modifi- 
cations without leaving the scope of the invention. For is 
example, in the above-described embodiment, there 
has been explained to realize the voice input means 12 
by connecting the microphone unit 1 15 to the computer 
system 100, and to prepare the minutes data by the 
voice data entered into such microphone unit 115. 20 
[0082] However, it is possible also to realize the 
voice input means by connecting a tape recorder (not 
shown) instead of the microphone unit 1 15 to the com- 
puter system 100, and to prepare the minutes data by 
the recorded voice from such tape recorder. It is also 25 
similarly possible to realize the voice input means by 
connecting a communication device such as a modem 
(not shown) instead of the microphone unit 115 to the 
computer system 100 and to prepare the minutes data 
by the voices transmitted and received by the communi- 30 
cation device. 

[0083] In the foregoing embodiment, there has 
been explained to realize the voice input means 12 by 
connecting an omnidirectional microphone unit 115 to 
the computer system 1 00 and to specify the speaker 2 35 
and recognize the content of statement thereof from the 
voice entered into such microphone unit 115. 
[0084] However it is also possible to provide each of 
the plural speakers 2 attending the conference with a 
switching device constituting identification Input means 40 
(not shown), and to cause the speaker 2, making state- 
ment, to manipulate such switching device thereby 
entering the identification data of the speakers 2 making 
statements in succession. In such case, the computer 
system 100 serves as vocalization selection means, 45 
thus selecting the vocalization data of the speaker 2 
from the vocalization memory means 1 1 corresponding 
to the entered Identification data, and recognizes the 
content of statement of the voice data entered into the 
voice Input means 1 2, based on thus selected vocaliza- so 
tion data, 

[0085] In such case, exclusive means is required for 
specifying the speaker 2 making statement and has to 
be precisely operated by the speaker 2, but It is ren- 
dered possible to securely specify the speaker 2 making 55 
statement without executing the voice recognition and 
to achieve the voice recognition of the content of state- 
ment with satisfactory accuracy. 



[0086] It is also possible to respectively provide the 
plural speakers 2 attending the conference with plural 
microphone units of ultra high directionality to constitute 
plural voice input means, and to specify the speaker 2 
making statement by the microphone unit by which the 
voice is entered. In such case there have to be prepared 
plural microphone units, but no particular operation Is 
for the speakers 2. 

[0087] Such configuration of respectively providing 
the plural speakers 2 with the voice input means is 
applicable also to a data communication apparatus (not 
shown) such as a telephone system or a television tele- 
phone system in which plural speakers 2 convene by 
data communication. In the data communication appa- 
ratus utilizing voice communication, the voice data from 
the source of communication are entered and subjected 
to data transmission, and those from the destination of 
communication are received by data reception and out- 
putted in voice, so that the voice data of the speakers 2 
of the source of communication and the destination of 
communication are cleariy distinguished. 
[0088] In case of applying the present invention to 
such data communication apparatus, the vocalization 
data are registered for each identification data of the 
speakers 2 of the source of communication and the des- 
tination of communication, and the voice data outputted 
by data reception are recognized by the vocalization 
data of the destination of communication while those 
transmitted by the voice input are recognized by the 
vocalization data of the source of communication. 
[0089] Also in the foregoing embodiment, there has 
been explained the preparation of the minutes data, 
recording the content of statements of the plural speak- 
ers 2 and the identification data in the time-sequential 
order, but such minutes data may also be edited in vari- 
ous forms if desired. For example it is possible to record 
or delete the content of statement of a particular 
speaker 2, or to arange the content of statements by 
the speakers 2 instead of the time-sequential order, or 
to change the font or color of printing for each speaker 
2. 

[0090] It is also possible to realize time detection 
means for detecting the current time at the data input of 
the voice of the speaker for example by TOD (time of 
day clock), and to utilize the detected time in the prepa- 
ration of the minutes data. In such case, there can be 
generated more detailed minutes data, since the time of 
statement of the speaker can also be recorded therein. 
[0091] Furthermore, In the foregoing embodiment, 
It has been explained that various means can be logi- 
cally realized as functions of the minutes generation 
system 1, by the function of the CPU 101 according to 
the control program stored as a software for example in 
the RAM 104. However it is also possible to fomn each 
of such various means by a specific hardware, or to 
store a part of such various means as a software in the 
RAM 1 04 or the like and a remaining part as hardwares. 
[0092] Also in the foregoing embodiment, it is 
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assumed that the software, installed in advance from 
the CD-ROM 108 or the like into the HDD 105. is copied 
In the HMA 1 04 at the activation of the computer system 
1 00 and that the software thus stored in the RAM 1 04 is 
read by the CPU 101, but it is also possible to utilize 
such software by the CPU 101 while such software is 
stored in the HDD 1 05 or to fixedly store the software In 
the ROM 103 in advance. 

[0093] It is furthermore possible to store the soft- 
ware in the FD 1 06 or the CCD-ROM 1 08 which is a sin- 
gly handlable infomriation memory medium, and to 
install the software from such FD 106 or the like into the 
HDD 105 or the RAM 104, and it is also possible that, 
without such installation, the CPU 101 executes 
processing by directly reading the software from the FD 
106 or the like. 

[0094] Thus, in case of realizing various means of 
the minutes generation system 1 of the present inven- 
tion by a software, such software is only required to be 
in a state in which the CPU 101 can read the software 
and can execute the corresponding operations. It Is also 
possible to form the control program, realizing the 
above-mentioned various means, by a combination of 
plural softwares, and. In such case, the information 
memory medium constituting a single product is 
required to only store the minimum software necessary 
for realizing the minutes generation system 1 of the 
present invention. 

[0095] For example, In case of providing a compu- 
ter system 100, in which a known operating system is 
installed, with an application software by an infomiatlon 
memory medium such as the CD-ROM 1 08, the soft- 
ware realizing various means of the minutes generation 
system 1 of the present invention can be realized by a 
combination of the application software and the operat- 
ing system, so that the software of a portion depending 
on the operating system can be omitted from the appli- 
cation software stored in the information memory 
medium. 

[0096] Also the method of supplying the software, 
stored in the information memory medium, with the 
CPU 101 Is not limited to the case of directly mounting 
such information memory medium in the computer sys- 
tem 100. For example it Is also possible to store such 
software in an Information memory medium of a host 
computer and to connect such host computer to a termi- 
nal computer through a communication network, 
thereby supplying the software by data communication 
from the host computer to the temriinal computer. 
[0097] In such configuration, the terminal computer 
can execute a stand-alone process in a state in which 
the software is downloaded in the information memory 
medium of the terminal computer, or can execute a 
process by a real-time data communication with the 
host computer, without downloading the software. In 
such case, the entire system constituted by the host 
computer and the terminal computer connected by the 
communication network con^esponds to the minutes 



generation system 1 of the present invention. 
[0098] Now, a flow upon using a conference room 
provided with a vofce-character conversion output sys- 
tem explained above will be explained with reference to 

5 Fig. 8. A user who wants to used a conference room 
provided with a voice-character conversion output sys- 
tem contracts in advance with the a service fimn which 
manages a conference room provided with a voice- 
character conversion output system and sets an ID and 

10 a password. Then, the user logins the system by using 
the ID and password and a service of the minutes gen- 
eration system is started. After a conference is com- 
pleted, a transition is made to post processes such as a 
transmission of data of digital-signal processed voice. In 

75 a case where a minutes is needed immediately, an edit 
processing is immediately executed at the host side of 
the service fimn and minutes data is transmitted to the 
conference room. The user refers to the minutes data, 
corrects the data if necessary and then approves of the 

20 data so that the data is established. The minutes data is 
output with payment being demanded in a predeter- 
mined manner. At this point, it is arranged not to execute 
print-out in a state where the minutes data is not estab- 
lished. Further, if there Is no need of the minutes imme- 

25 diately, the above-mentioned minutes can be output 
later by accessing the host of the service firm. 
[0099] In this embodiment, a voice-character con- 
version output system is explained by exemplifying a 
minutes generation system. However, it can be appreci- 

30 ated that the invention is not limited to the embodiment 
and, for example, a configuration of the invention can be 
changed and modified in any manners as far as having 
the arrangement for executing an output in a character 
fomri by discriminating plural voices. 

35 [0100] The present invention of the above- 
described configuration provides the following advan- 
tages. 

[0101] In the conference voice processing method 
utilizing the first conference voice recording apparatus 

40 of the present invention, when the voice data of plural 
speakers are entered into the voice input means in a 
state in which the vocalization data are registered for 
each identification data of the plural speakers, the voice 
recognition means recognizes the speaker and the con- 

45 tent of statement of the entered voice data, based on 
the vocalization data and identification data registered 
in the vocalization memory means, and the record prep- 
aration means prepares minutes data of the recognized 
control of statements, edited in a predetermined form 

50 together with the identification data of the speakers, 
whereby the minutes data of the content of statements 
of the plural speakers can be automatically prepared on 
real time basis, edited In a predetermined U>rm with the 
identification data, so that the minutes of the conference 

55 can be prepared without an exclusive worker. 

[0102] In the conference voice processing method 
utilizing the second conference voice recording appara- 
tus of the present invention, when the voice data of 
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Statements of plural speakers are entered into the voice 
Input nneans and at the same time the Identification data 
of the speakers making statements are entered In suc- 
cession in the identification input means in a state in 
which the vocalization data are registered for each iden- 5 
tification data of the plural speakers, the pronunciation 
selection means selects the vocalization data of the cor- 
responding speaker from the vocalization memory 
means, corresponding to the entered identification data, 
then the voice recognition means recognizes the con- 10 
tent of statement of the entered voice data, based on 
the selected vocalization data, and the record prepara- 
tion means prepares minutes data of the recognized 
content of statements, edited in a predetermined form 
together with the identification data of the speakers, 75 
whereby the minutes data of the content of statements 
of the plural speakers can be automatically prepared on 
real time basis, edited in a predetermined fomn with the 
Identification data, so that the minutes of the conference 
can be prepared without an exclusive worker. Also the 20 
speaker making statement can be securely specified 
without the voice recognition, and the accuracy of the 
voice recognition of the content of statement can be 
Improved. 

[0103] In the conference voice processing method 25 
utilizing the third conference voice recording apparatus 
of the present invention, when the voice data of state- 
ments of plural speakers are entered respectively Into 
the plural voice Input means in a state in which the 
vocalization data are registered in the vocalization 30 
memory means for each Identification data of the plural 
speakers, the voice recognition means recognizes the 
content of statement of the voice data entered into the 
plural voice input means, based on the vocalization data 
of the corresponding speaker registered in the vocaliza- 35 
tion memory means, and the record preparation means 
prepares minutes data of the recognized content of 
statements, edited in a predetermined form together 
with the identification data of the speakers, whereby the 
minutes data of the content of statements of the plural 40 
speakers can be automatk;ally prepared on real time 
basis, edited In a predetennined form with the identifica- 
tion data, so that the minutes of the conference can be 
prepared without an exclusive worker. In particular, the 
speaker making statement can be almost securely 45 
specified without voice recognition, and the accuracy of 
voice recognition of the content of statement can be 
Improved. 

[0104] In the conference voice processing method 
utilizing the fourth conference voice recording appara- so 
tus of the present invention, when the plural speakers 
convene by data communication, there are automati- 
cally prepared, on real time basis, minutes data of the 
content of statements in such conference by data com- 
munication, whereby the minutes data of the content of 55 
statements of the plural speakers can be automatically 
prepared on real time basis, edited In a predetenmlned 
form with the identification data. 



[0105] Also in the above-described conference 
voice recording apparatus, when, in a state in which the 
image display means displays the minutes data, the cor- 
rection on the displayed minutes data is entered in the 
correction input means, the correction execution means 
corrects the minutes data in response to the input data, 
whereby the minutes data can be appropriately cor- 
rected while they are displayed, so that the minutes data 
can be prepared in an appropriate form even if the voice 
recognition is not secure. 

[01 06] Also when, in a state in which the image dis- 
play means displays the minutes data, the approval for 
the displayed minutes date is entered into the approval 
Input means for each identification data of the speakers, 
the approval addition means adds thus entered 
approval to the minutes data together with the Identifica- 
tion data of the speaker, whereby the minutes data can 
also record the approvals by the speakers. 
[01 07] Also by the input of the approval for the min- 
utes data in the form of voice data into the voice input 
means, there can be entered the approval for each 
speaker, by the voice data of which the speaker can be 
specified. 

[0108] Also the record printing means prints the 
minutes data, so that there can be printed a document 
reconjing the content of statements of the plural speak- 
ers, together with the identification data of the speakers. 
[0109] Further, the time detection means detects 
the cun-ent time at the input of voice data of statement 
of the speaker and the record preparation means uti- 
lizes the detected time in preparing the minutes data, 
whereby the prepared minutes data can also record, for 
example, the time of statement by the speaker. 

Claims 

1. A conference voice processing method comprising 
steps of: 

registering vocalization data for each identifica- 
tion data of plural speakers; 
entering voice data of statements of the plural 
speakers; 

recognizing the speaker and content of state- 
ment of the entered voice data, based on said 
vocalization data and said identification data; 
and 

fomning minutes data of the recognized content 
of statements, edited in a predetermined form 
together with the identification data of the 
speakers. 

2. A conference voice processing method comprising 
steps of: 

registering vocalization data for each identifica- 
tion data of plural speakers; 
entering voice data of statements of the plural 
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6. 



speakers individually for each thereof; 
recognizing the content of statement of the 
individually entered voice data, based on said 
vocalization data and said identification data of 
the corresponding speaker; and 
forming minutes data of the recognized content 
of statements, edited in a predetermined fomn 
together with the identification data of the 
speakers. 

A conference voice processing method comprising 
steps of: 

registering vocalization data for each identifica- 
tion data of plural speakers; 
entering voice data of statements of the plural 
speakers; 

entering the identification data of the speakers 
making statements in succession; 
selecting the vocalization data of the speaker, 
con-esponding to the entered identification 
data; 

recognizing the content of statement of the 
entered voice data, based on the selected 

vocalization data; and 

forming minutes data of the recognized content 
of statements, edited in a predetermined form 
together with the identification data of the 
speakers. 

A conference voice processing method in a data 
communication apparatus in which plural speakers 
execute data communication, the method compris- 
ing steps of: 

registering vocalization data for each identifica- 
tion data of the plural speakers convening by 
data communication; and 
forming minutes data from the voice data of 
statements of the plural speakers in data com- 
munication, by a conference voice processing 
method according to any of claims 1 to 3. 

A conference voice processing method according 
to any of claims 1 to 4, wherein cunrent time 
detected at the entry of the voice data of the 
speaker, and the minutes data are formed utilizing 
also the detected time. 

A conference voice recording apparatus compris- 
ing: 

vocalization memory means for registering 
vocalization data for each identification data of 
plural speakers; 

voice input means for entering voice data of 

statements of the plural speakers; 

voice recognition means for recognizing the 



speaker and content of statement of the 
entered voice data into said voice Input means, 
based on the vocalization data and the identifi- 
cation data registered in said vocalization 

5 memory means; and 

record forming means for forming minutes data 
of the content of statements recognized by said 
voice recognition means, edited in a predeter- 
mined form together with the identification data 

10 of the speakers. 

7. A conference voice recording apparatus compris- 
ing: 

75 vocalization memory means for registering 

vocalization data for each Identification data of 
plural speakers; 

voice input means for entering voice data of 
statements of the plural speakers; 
20 identification input means for successively 

entering the identification data of the speakers 
making statements; 

vocalization selection means for selecting the 
vocalization data of the speaker from said 
25 vocalization memory means, corresponding to 

the identification data entered into said identifi- 
cation Input means; 

voice recognition means for recognizing the 
content of statement of the voice data entered 

30 into said voice input means, based on the 

vocalization data selected by said pronuncia- 
tion selection means; and 
record preparation means for preparing min- 
utes data of the content of statements recog- 

35 nized by said voice recognition means, edited 

in a predetemriined fomri together with the iden- 
tification data entered Into the identification 
input means. 

40 8, A conference voice recording apparatus compris- 
ing: 

vocalization memory means for registering 
vocalization data for each identification data of 
45 plural speakers; 

plural voice input means for individually enter- 
ing voice data of statements of the plural 
speakers; 

vok:e recognition means for recognizing the 
50 content of statement of the voice data entered 

into the plural voice input means, based on the 
vocalization data of the corresponding speaker 
in said vocalization memory means; and 
record preparation means for preparing min- 
55 utes data of the content of statements recog- 

nized by said voice recognition means, edited 
in a predetermined fonm together with the iden- 
tification data of the speakers. 
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9. A data communication apparatus for data commu- 
nication of plural speakers, comprising a confer- 
ence voice processing apparatus according to any 
of claims 6 to 8, wherein: 

5 

said vocalization memory means registers 
vocalization data for each identification data of 
the plural speakers convening by data commu- 
nication; and 

said record preparation means prepares min- io 
utes data from the voice data of statements of 
the plural speakers in data communication. 

10. A conference voice recording apparatus according 

to any of claims 6 to 9. further comprising: is 

image display means for displaying said min- 
utes data; 

con'ection input means for entering a correction 
on the minutes data displayed by said image 20 
display means; and 

con-ection execution means for correcting said 
minutes data, corresponding to the input data 
of said correction input means. 

25 

11. A conference voice recording apparatus according 
to any of claims 6 to 10, further comprising: 

approval input means for entering an approval 
for the minutes data displayed by said image 30 
display means, for each identification data of 
the speakers; and 

approval addition means for adding the 
approval, entered into said approval input 
means, to said minutes data together with the 35 
identification data of the speaker. 

12. A conference voice recording apparatus according 
to claim 11, wherein said approval input means 
enters the approval for the minutes data in the fonm 4o 
of voice data by said voice input means. 

13. A conference voice recording apparatus according 
to any of claims 6 to 12, further comprising record 
printing means for printing said minutes data. 45 

14. A conference voice recording apparatus according 
to any of claims 6 to 13, further comprising time 
detection means for detecting the cun-ent time at 
the entry of the voice data of statement of the so 
speaker into said voice input means; 

wherein said record preparation means pre- 
pares the minutes data, also utilizing the time 
detected by said time detection means. 

55 

15. An infonnatlon memory medium storing a software 
readable by a computer to which connected are 
vocalization memory means for registering the 



vocalization data for each identification data of plu- 
ral speakers, and voice input means for entering the 
voice data of statement of the plural speakers; 

wherein said memory medium stores a pro- 
gram for causing said computer to execute; 

recognition of the speaker and the content of 
statement of the voice data entered Into said 
voice input means, based on the vocalization 
data and identification data registered in said 
vocalization memory means; and 
preparation of minutes data of the recognized 
content of statements, edited in a predeter- 
mined fomri together with the identification data 
of the speakers. 

16. An information memory medium storing a software 
readable by a computer to which connected are 
vocalization memory means for registering the 
vocalization data for each identification data of plu- 
ral speakers, and voice input means for entering the 
voice data of statement of the plural speakers; 

wherein said memory medium stores a pro- 
gram for causing said computer to execute; 

reception of the input of the identification data 
of the speaker making statement; 
reception of the input of the identification data 
of the speaker making statement; 
selection of the vocalization data of the 
speaker from said vocalization memory means, 
corresponding to the entered identification 
data; 

recognition of the content' of statement of the 
voice data entered Into said voice input means, 
based on the selected vocalization data; and 
fomriatlon of minutes data of the recognized 
content of statements, edited in a predeter- 
mined forni together with the entered identifica- 
tion data. 

17. A voice-character conversion output system com- 
prising: 

vocalization memory means for registering 
vocalization data for each of plural identification 
data; 

voice input means for entering data of the plural 
voices; 

voice recognition means for recognizing the 
voices entered into said voice input means, 
based on the vocalization data and the identifi- 
cation data registered in said vocalization 
memory means; and 

record forming means for forming character 
data of a content of the voice recognized by 
said voice recognition means, edited in a pre- 
determined form together with the identification 
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data. 

18. A speech processing system comprising: 

means for storing identification data of plural s 
speal<ers; 

means for storing speech recognition reference 
models for each of the plural speakers; 
means for receiving voice data of statements 
made by one or more of said plural speakers; io 
means for comparing said voice data with said 
identification data to determine the one or more 
speakers in the received voice data; 
means for recognising the content of said 
statements by comparing said voice data with 75 
the speech recognition reference models for 
the identified speakers; and 
means for generating minutes data of the rec- 
ognised content of said statements. 

20 

19. A storage medium storing processor implementa- 
ble instructions for controlling the processor to 
implement the method of any one of claims 1 to 5. 

20. Processor implementable instructions for control- 25 
ling a processor to implement the method of any 
one of claims 1 to 5. 
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